JPEG 2000-like access using the JPM compound document file format

نویسندگان

  • Martin P. Boliek
  • Gene K. Wu
چکیده

Compound document images are usually high resolution and high quality images that include color, graphics, and images in addition to text. The need for good compression is important for storage and transmission. Due to the large size, even compressed, it is often difficult to access document images quickly and efficiently for display on monitors. The nascent JPM file format enables the best compound document image compression in terms of rate-distortion. Although JPM allows the use of JPEG 2000, access into a JPM file is limited by the access features of the older coders used. JPEG 2000 is an image coding system that allows access to lower resolutions, progressive bit-rates, and regions of interest. This paper describes methods for using JPEG 2000 in conjunction with older binary coders in a JPM file. Using these techniques it is possible to get close to the best rate-distortion performance and have access into the file. 1 INTRODUCTION Documents often include text, graphics, and imagery. Magazines, newspapers, brochures, annual reports, have had these attributes for a long time. With the popularity of desktop publishing, color scanners, color printers, color copiers, and color digital cameras for the consumer and office markets, the ability to make use of color, graphics, and imagery in documents is now commonplace. These documents are often reused across media modalities such as print and on the World Wide Web. There is a need for different and interactive access into the document format. Traditional facsimile compression technologies (G3, G4, MMR, JBIG) are insufficient for color images. Baseline JPEG compression sacrifices quality and is not so efficient for sharp edges created by text. Furthermore, none of these allow access of lower resolution, progression from lossy to lossless, or access to regions-of-interest. This access is useful for delivering document images from databases or capture devices, to different target devices such as computer and PDA displays, and printers. Two new standard technologies are designed to address these problems: the nascent JPM file format (JPEG 2000 Mixed Raster Content) [1] and the JPEG 2000 coding system [2][8]. 1.1 JPM file format technology The JPM file format is the latest descendent of a number of innovative technologies. Mixed Raster Content was standardized as ITU-T Rec. T.44 [3]. This standard is used in the IETF facsimile standard [9] and Xerox's Digipaper product [10]. Another related technology that preceded JPM is DjVu [11] which uses wavelet technology for the continuous-tone …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SmartNails: display- and image-dependent thumbnails

In order to overcome poor readability of text and recognizability of image features in low resolution thumbnails, a novel image representation of compound document images a SmartNail representation is presented. SmartNails are replacements or supplements to traditional thumbnails for compound documents and contain cropped and scaled image and text segments. Image-based analysis and text-based a...

متن کامل

From TIFF to JPEG 2000? Preservation Planning at the Bavarian State Library Using a Collection of Digitized 16th Century Printings

Studies and user reports claim JPEG 2000 to be – or at least will become – the next archiving format for digital images [1]. The format offers new possibilities, such as streaming, and reduces storage consumption through lossless and lossy compression [2]. Another often claimed advantage of JPEG 2000 is that the master image can possibly serve as the access copy as well, and thus replace derive...

متن کامل

Segmentation and compression of documents with JPEG2000

We review the standard JPEG2000 for still image compression and mention some typical applications. Special weight is put onto the core coding system described in Part 1 and the compound image file format for document imaging described in Part 6 including a section on image segmentation. Index Terms — JPEG2000, still image compression, mixed raster graphics, segmentation

متن کامل

Assessing the Ocr Degradation in the Generation of Jpeg, Png, and Tiff Files from Adobe Pdf

Adobe Portable Document Format is de facto standard today due to its widespread use. One of the features of Adobe pdf is that it allows exporting documents as images that may be saved in JPEG, PNG, and TIFF formats. This paper uses an OCR platform to quantitatively assess the quality of these image file

متن کامل

Icon based error concealment for JPEG and JPEG 2000 images

This paper describes methods to recover the useful data in JPEG and JPEG 2000 compressed images and to estimate data for those portions of the image where correct data cannot be recovered. These techniques are designed to handle the loss of hundreds of bytes in the file. No use is made of restart markers or other optional error detection features of JPEG and JPEG 2000, but an uncorrupted low re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003